SemanticScuttle - klotz.me » Tags: simon willison+llm

Tags: simon willison* + llm*

0 bookmark(s) - Sort by: Date ↓ / Title /

Long context support in LLM 0.24 using fragments and template plugins

LLM 0.24 introduces fragments and template plugins to better utilize long context models, improving storage efficiency and enabling new features like querying logs by fragment and leveraging documentation. It also details improvements to template handling and model support.

2025-04-08 Tags: llm, context, simon willison by klotz

Qwen2.5-VL-32B: Smarter and Lighter

A review of the Qwen2.5-VL-32B large language model, noting its performance, capabilities, and how it runs on a 64GB Mac. Includes a demonstration with a map image and performance statistics.

2025-03-26 Tags: vision, llm, qwen, simon willison by klotz

Here’s how I use LLMs to help me write code

Simon Willison discusses his experience using Large Language Models (LLMs) for coding, providing detailed advice on how to effectively use LLMs to augment coding abilities, set reasonable expectations, manage context, and more.

2025-03-12 Tags: llm, coding, ai-assisted programming, simon willison by klotz

How To Use LLMs For Programming Tasks

A guide on using large language models (LLMs) for programming tasks, including examples, strategies, and useful tips for effectively using AI assistants like ChatGPT and Claude.

2025-03-12 Tags: hackaday, llm, simon willison, programming by klotz

Claude 3.7 Sonnet, extended thinking and long output, llm-anthropic 0.14

Simon Willison discusses the release of llm-anthropic 0.14, which adds support for Claude 3.7 Sonnet's new features. Key features include extended thinking mode, a massive increase in output limits, and improved support for long tasks. The article also covers the plugin's implementation details and limitations.

2025-02-25 Tags: claude, claude 3.7 sonnet, llm-anthropic, llm, simon willison by klotz

OpenAI reasoning models: Advice on prompting

OpenAI's documentation for their o1 and o3 'reasoning models' includes tips on how to best prompt them, such as using developer messages, delimiters, and specific instructions.

2025-02-03 Tags: llm, prompting, simon willison, openai by klotz

Qwen2.5-1M: Deploy Your Own Qwen with Context Length up to 1M Tokens

Alibaba's Qwen 2.5 LLM now supports input token limits up to 1 million using Dual Chunk Attention. Two models are released on Hugging Face, requiring significant VRAM for full capacity. Challenges in deployment with quantized GGUF versions and system resource constraints are discussed.

2025-01-28 Tags: qwen2.5-1m, alibaba, hugging face, gguf, llm, simon willison by klotz

My AI/LLM Predictions for the Next 1, 3, and 6 Years, for Oxide and Friends

Simon Willison shares his predictions regarding the development of AI and LLMs over the next 1, 3, and 6 years. He discusses the potential failure of AI agents to fully realize their expected capabilities, the success of coding and research assistants, a Pulitzer prize for AI-assisted investigative reporting within three years, the emergence of privacy laws, the creation of amazing art in six years, and concerns about AGI/ASI leading to mass civil unrest.

2025-01-10 Tags: llm, oxide, agents, simon willison by klotz

Things we learned about LLMs in 2024

A review of advancements and key themes in Large Language Models over the course of 2024, including GPT-4 barrier breaking, reduced costs, multimodal capabilities, and more.

2025-01-01 Tags: llm, simon willison, 2024 by klotz

files-to-prompt

Concatenate a directory full of files into a single prompt for use with LLMs

2024-12-12 Tags: files-to-prompt, llm, simon willison, github by klotz

SemanticScuttle - klotz.me

Tags: simon willison* + llm*

Linked Tags

Related Tags